首页> 外文OA文献 >Adaptive opponent modelling for the iterated prisoner's dilemma
【2h】

Adaptive opponent modelling for the iterated prisoner's dilemma

机译:针对反复囚徒困境的自适应对手建模

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

This paper describes the design of Laran, an intelligent player for the iterated prisoner's dilemma. Laran is based on an evolutionary algorithm, but instead of using evolution as a mean to define a suitable strategy, it uses evolution to model the behavior of its adversary. In some sense, it understands its opponent, and then exploits such knowledge to devise the best possible conduct. The internal model of the opponent is continuously adapted during the game to match the actual outcome of the game, taking into consideration all played actions. Whether the model is correct, Laran is likely to gain constant advantages and eventually win. A prototype of the proposed approach was matched against twenty players implementing state-of-the art strategies. Results clearly demonstrated the claims
机译:本文介绍了Laran的设计,Laran是一个解决被囚徒困境的智能播放器。 Laran基于进化算法,但没有使用进化方法来定义合适的策略,而是使用进化模型来模拟对手的行为。从某种意义上说,它了解对手,然后利用这些知识来设计最佳行为。考虑到所有玩过的动作,在比赛期间不断调整对手的内部模型,以匹配比赛的实际结果。无论模型是否正确,Laran都有可能获得持续的优势并最终获胜。所提议方法的原型与实施最新战略的20名参与者相匹配。结果清楚地证明了要求

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号